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Abstract. We study the large deviations of excited random walks on Z. We prove a large deviation 
principle for both the hitting times and the position of the random walk and give a qualitative 
description of the respective rate functions. When the excited random walk is transient with 
positive speed vo, then the large deviation rate function for the position of the excited random 
walk is zero on the interval [0, vo] and so probabilities such as P{X n < nv) for v G (0,t>o) decay 
subexponentially. We show that rate of decay for such slowdown probabilities is polynomial of the 
order n 1-l5/ ' 2 , where S > 2 is the expected total drift per site of the cookie environment. 



In this paper we study the large deviations for one-dimensional excited random walks. Excited 
random walks are a model for a self-interacting random walk, where the transition probabilities 
depend on the number of prior visits of the random walk to the current site. The most general 
model for excited random walks on Z is the following. Let Q = [0, l] ZxN ; and for any element 
co = {uii(j)}i£Z, j>i G ^ we can define an excited random walk X n by letting ix>i(j) be the probability 
that the random walk moves to the right upon its j-th visit to the site i 6 Z. More formally, we 
will let Pu(X = 0) and 

P u (X n+ i = x + l\X n = x) = 1 - P w (A" n+ i = x - l\X n = x) = uo x (#{k <n: X k = x}) . 

Note that the excited random walk X n is not a Markov chain since the transition probabilities 
depend on the entire past of the random walk and not just the current location. 

Excited random walks are also sometimes called cookie random walks, since one imagines a stack 
of "cookies" at every site which each induce a specific bias to the walker. When the walker visits 
the site x for the i-th time, he eats the i-th cookie which causes his next step to be as a simple 
random walk with parameter uj x (i). For this reason we will also refer to cu = {wi(j)}iez, j>i as a 
cookie environment. 

We can also assume that the cookie environment to is first chosen randomly. That is, let P be a 
probability distribution on the space of cookie environments f2, and define a new measure on the 
space of random walk paths Z z+ by averaging over all cookie environments. That is, let 



Jn 

For a fixed cookie environment u>, the law P^ is referred to as the quenched law of the excited 
random walk, and P is called the averaged law of the excited random walk. 
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Most of the results for excited random walks make the assumption that there are only finitely 
many cookies per site. That is, there exists an M such that uJi(j) = 1/2 for any i € Z and j > M 
so that after M visits to any site the transitions are like a simple symmetric random walk. 

Assumption 1. There exists an integer M < oo such that there are almost surely only M cookies 
per site. That is, P(f^M ) = 1, where 

n M = nn{u: = 1/2, v« e z, vj > m}. 

We will also make the common assumption that the cookie environment is i.i.d. in the following 
sense. 

Assumption 2. The distribution P is such that the sequence of cookie environments at each site 
is i.i.d. 

Finally, we will make the following non-degeneracy assumption on cookie environments. 
Assumption 3. With M as in Assumption^ 



Excited random walks were first studied by Benjamini and Wilson in [BW03J, where they con- 
sidered the case of deterministic cookie environments with one cookie per site (that is M = 1). The 
focus of Benjamini and Wilson was mainly on the Z rf case, but they showed that excited random 
walks with one cookie per site are always recurrent. The model was further generalized by Zerner 
in [Zer05j to allow for multiple cookies per site and for randomness in the cookie environment, but 
with the restriction that all cookies induced a non- negative drift (that is uJi(j) > 1/2). Recently 
the model of excited random walks was further generalized by Zerner and Kosygina to allow for 
cookies with both positive and negative drifts [KZ08]. 

The recurrence/transience and limiting speed for one-dimensional excited random walks have 
been studied in depth under the above assumptions. A critical parameter for describing the behavior 
of the excited random walk is the expected total drift per site 

V M 1 



Zerner showed in |Zer05| that excited random walks with all cookies uJi(j) > 1/2 are transient to 
+oo if and only if 5 > 1. Additionally, Zerner showed that the limiting speed vq = linin^oo X n /n 
exists, P-a.s., but wasn't able to determine when the speed is non-zero. Basdevant and Singh 
solved this problem in [BS07J where they showed that vo > if and only if 5 > 2. These results for 
recurrence/transience and the limiting speed were given only for cookies with non- negative drift but 
were recently generalized by Kosygina and Zerner [KZ08J to the general model we described above 
that allows for cookies with both positive and negative drifts. In summary, under Assumptions [T] 
[3j the following results are known. 

• X n is recurrent if and only if 5 £ [—1, 1]. Moreover, 



M M 



E YlMJ) > and E JJ(l-woO")) > °- 




(1) 





LARGE DEVIATIONS AND SLOWDOWNS 



3 



• There exists a constant vq such that lim„_ ! . 00 X n /n = vq, P-a.s. Moreover, vq = if and 
only if Se [-2,2]. 

Limiting distributions for excited random walks are also known with type of rescaling and limiting 
distribution depend only on the parameter 6 given in 0. The interested reader is referred to the 
papers I5S08. KZ08, KM11 | iDollll |DK12| for more information on limiting distributions. 



1.1. Main Results. In this paper, we are primarily concerned with the large deviations of excited 
random walks. In a similar manner to the approach used for large deviations of random walks 
in random environments, we deduce a large deviation principle for X n /n from a large deviation 
principle for T n /n, where 

T n = ml{k > : X k = n}, n£Z 
are the hitting times of the excited random walk. However, we don't prove a large deviation principle 
for the hitting times directly. Instead, we use an associted branching process with migration Vi 
that has been used previously in some of the above mentioned papers on the speed and limiting 
distributions for excited random walks [BS07, KZ08, KMllJ. We prove a large deviation principle 
for n _1 Yl7=i an d use this to deduce a large deviation principle for T n /n which in turn implies 
the following large deviation principle for X n /n. 

Theorem 1.1. The empirical speed of the excited random walk X n /n satisfies a large deviation 
principle with rate function Ix defined in ( |28[ ). That is, for any open set G C [—1,1], 

(2) liminf - log P(X n /n G G) > - inf I x (x), 

n— >oc n x£G 

and for any closed set F C [—1,1], 

limsup — log P(X n /n G F) < — inf Ix{x). 

n-s>oo Tl zG-F 

Remark 1.2. After the initial draft of this paper was completed, it was noted that a general large 
deviation principle for certain non-Markovian random walks due to Rassoul-Agha [RA04] can be 
used to prove Theorem |1.1| in certain cases. Thus, it is necessary to point out some of the differences 
with the current paper. 



In [RA04] the random walks are assumed to be uniformly elliptic, which in the context of 
this paper would require 0Ji(j) G [c, 1 — c] for all i G Z, j > 1 and some c > 0. In contrast, 
we only assume the weaker condition in Assumption [3} 

The results of [RA04] only apply directly to excited random walks with deterministic 
cookie environments. If the cookie environments are allowed to be random and satisfying 
Assumption [2j then a technical difficulty arises in satisfying one of the conditions for the 
large deviation principle in [RA04J. Specifically, the transition probabilities q(w, z) for the 
shifted paths as defined in |RA04| do not appear to be continuous in w for the required 
topology. We suspect, however, that the techniques of |RA04j could be adapted to apply 
to this case as well. 

The formulation of the large deviation rate function in [R A04| is difficult to work with 
and the only stated properties of the rate function are convexity and a description of the 
zero set. In contrast, our method gives a more detailed description of the rate function 
(see Lemma 5.1 and Figure [3]). 

The method in RAO lj does not also give a large deviation principle for the hitting times 
of the random walk. 
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As mentioned in the above remark, the formulation of the rate function Ix given in the proof of 
Theorem 1 1 . 1 1 allows us to give a good qualitative description of the rate function (see Lemma 5.1) 



One particularly interesting property is that when 5 > 2 (so that the limiting speed vq > 0) then 
Ix{x) = when x 6 [0, vq\. Thus, probabilities of the form P(X n < nx) decay subexponentially if 
x & (0, Vq). In fact, as the following example shows, one can see quite easily that such slowdown 
probabilities must have a subexponential rate of decay. 

Example 1.1. We exhibit a naive strategy for obtaining a slowdown of the excited random walk. 
Consider the event where the excited random walk first follows a deterministic path that visits 
every site in [0, n 1//3 ) M times (so that no cookies remain in the interval) and then the random 
walk stays in the interval [0, n 1//3 ) for n steps. The probabilistic cost of forcing the random walk to 
follow the deterministic path at the beginning is e - c ' Mnl/i for some d > 0. Then, since there are 
no cookies left in the interval, the probability of then staying in [0, n 1 / 3 ) for n steps before exiting 
to the right is a small deviation computation for a simple symmetric random walk. The probability 

of this event can be bounded below by Ce~ c " nXlZ for some C,c" > (see Theorem 3 in |Mog74| ) . 

173 

Thus, the total probability of the above event for the excited random walk is at least Ce~ cn 

The example above shows that P(X n < xn) decays slower than a stretched exponential. However, 
this strategy turns out to be far from the optimal way for obtaining such a slowdown. The second 
main result of this paper is that the true rate of decay for slowdowns is instead polynomial of the 
order n 1_<5 / 2 . 

Theorem 1.3. If 5 > 2, then 

(3) lim - = 1 ~o' VxG(0,u ), 

n->-oo log n 2 

and 

,. logP(T n >nt) , 6 u 
4 lim ; n = 1 - -, Vt > 1 v . 

n->oo log n 2 

1.2. Comparison with RWRE. Many of the prior results for one-dimensional excited random 
walks are very similar to the corresponding statements for random walks in random environments 
(RWRE). For instance, both models can exhibit transience with sublinear speed and they have 
the same types limiting distributions for the hitting times and the location of the random walk 
[KZ08, Sol75, KKS75J. Thus, it is interesting to compare the results of this paper with what is 
known for one-dimensional RWRE. 

Large deviations for one-dimensional RWRE (including a qualitative description of the rate 
functions) were studied in [CGZOOj and subexponential slowdown asymptotics for ballistic RWRE 



similar to Theorem 1.3 were studied in jDPZ96j . The similarities to the current paper are greatest 
when the excited random walk has 5 > 2 and the RWRE is transient with positive speed and 
"nestling" (i.e., the environment has positive and negative drifts). In this case, the large deviation 
rate function for either model is zero on the interval [0,t>o], where vo = lim n _ >00 X n /n is the 
limiting speed. Moreover, the polynomial rates of decay of the slowdown probabilities are related 
to the limiting distributions of the random walks in the same way. For instance, in either model 
if the slowdown probabilities decay like n 1_a with a G (1,2) then n~ l / a (X n — uvq) converges in 
distribution to an a-stable random variable [KZ08J IKKS75j . 

An interesting difference in the rate functions for excited random walks and RWRE is that 
/UO) = in the present paper, while for transient RWRE the left and right derivatives of the rate 
function are not equal at the origin [CGZOOj. Since (in both models) Ix is defined in terms of 
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the large deviation rate function ir(£) f° r the hitting times T n /n, this is related to the fact that 



inff = for excited random walks (see Lemma 4.1) while the corresponding rate function for 



the hitting times of RWRE is uniformly bounded away from if the walk is transient to the left. 

1.3. Outline. The structure of the paper is as follows. In Section [2] we define the associated 
branching process with migration Vi, mention its relationship to the hitting times of the excited 
random walk, and prove a few basic properties about the process Vi. Then in Section [3] we prove a 
large deviation principle for the empirical mean of the process Vi and prove some properties of the 
corresponding rate function. The large deviation principle for the empirical mean of the process Vi 
is then used to deduce large deviation principles for T n /n and X n /n in Sections [4] and [5} respectively. 
Finally, in Section [6] we prove the subexponential rate of decay for slowdown probabilities. 

2. A RELATED BRANCHING PROCESS WITH RANDOM MIGRATION 

In this section we recall how the hitting times T n of the excited random walk can be related 
to a branching process with migration. We will construct the related branching process with 
migration using the "coin tossing" construction that was given in [KZ08j . Let a cookie environment 
co = {wi(j)}iez,j>i be fixed, and let {£i,j}iez,j>i be an independent family of Bernoulli random 
variables with P = 1) = coi(j). For i fixed, we say that the j-th Bernoulli trial is a "success" if 

£ij = 1 and a "failure" otherwise. Then, let Fm be the number of failures in the sequence {Ci,j}j>i 
before the m-th success. That is, 

r i } 

F$ = min < t > 1 : £y = m > — m. 

I J =1 J 

Finally, we define the branching process with migration { V^}^>i by 

Vb = 0, and V i+1 = F$ +1 , for i > 0. 

If the uJi(j) were all equal to 1/2 then the process {V^} would be a critical Galton- Watson branching 
process with one additional immigrant per generation. Allowing the first M cookie strengths at 
each site to be different than 1 /2 has the effect of making the migration effect more complicated (in 
particular, the migration in each generation is random and can depend on the current population 
size). We refer the interested reader to |BS07| for a more detailed description of the interpretation 
of Vi as a branching process with migration. 

In addition to the above branching process with migration, we will also need another branching 
process with a random initial population and one less migrant each generation. For any n > 1, let 
Vq = V n where V n is constructed as above and let V^ = F^ n ^? , where we let F(p = 0. Note 

i— 1 

(n) 

that with this construction, we have that V { < V n +i for all i. Moreover, while the Markov chain 

(n) 

Vi is irreducible, the lack of the extra migrant each generation makes an absorbing state for V^ . 

The relevance of the processes {Vj}j>o and {V^}i>o to the hitting times T n of the excited 
random walk is the following. 

n oo 

(5) T B £n + 2j> + 2j>/ n >. 

i=i i=i 
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To explain this relation let f7" = #{k < T n : X k = i, X k +i = i — 1} be the number of times the 
random walk jumps from i to i — 1 before time T n . Then, it is easy to see that T n = n + 2 J2i<n 
and ([5]) follows from the fact that 

(6) (fC . . . [/?, Etf , L^, f/™ 2 , . . .) = (Vi, V 2 , . . . , K-i, V n , V} n) ,vt\ ■ ■ •)• 

The details the above joint equality in distribution can be found in |BS07j or [KMllj, 

Remark 2.1. Technically, the relation [5] is proved in |BS07j and |KM11| only in the cases where 
T m < 00 with probability one. However, an examination of the proof shows that P(T n = k) = 

P(n + Y^l=i V + 2 Yli=i = k) ^ or an y finite k and so both sides of ^ are infinite with the 
same probability as well. 



2.1. Regeneration structure. We now define a sequence of regeneration times for the branching 
process Vi. Let co = and for k > 1 

a k = mi{i > o k _ x : Vi = 0}. 

Also, for k > 1 let 

i=i 

be the total offspring of the branching process by the k th regeneration time. The tails of o\ and 
W\ were analyzed in [KMllj in the case when 5 > 0. 

Lemma 2.2 (Theorems 2.1 and 2.2 in [KMllj ). If S > then, 

(7) P(oi > x) ~ Cix-" 5 and P(Wi > x) ~ C 2 a;- ,5/2 as a; -> 00. 



Note that if the Markov chain Vi is transient, then eventually at = Wt = 00 for all k large 
enough. The following Lemma specifies the recurrence/transience properties of the Markov chain 
Vi. 

Lemma 2.3. The Markov chain Vi is recurrent if and only if 5 > and positive recurrent if and 
only if 5 > 1. 



Proof. The tail decay of <j\ shows that i5[<7i] < 00 if 5 > 1 and £7[oi] = 00 if 5 G (0,1]. Therefore, 
it is enough to show that Vi is recurrent if and only if 5 > 0. This can be proven by an appeal 
to some previous results on branching proceses with migration as was done in [KZ08j. A small 
difficulty arises in that the distribution of the migration that occurs before the generation of the 
(i + l)-st generation depends on the population of i-th generation. However, this can be dealt with 
in the same manner as was done in |KZ08j . To see this, let V, be defined by 

V = 0, and V i+1 = . . 

u ' + (Vi+l)VAf 

Note that Vi and Vi have the same transition probabilities when starting from a site k > M — 1, 
and thus Vi and Vi are either both recurrent or both transient. 

Next, let Zi = Vi+i — FjJ. We claim that Zi is recurrent if and only if Vi is also recurrent. 
Since < Zi < V%+x, Zi is recurrent if Vi is recurrent. To see the other implication, note that 
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Zi = s — Fw is the number of failures in ff» 9 },>i between the M-th success and success 

number (V£ + 1) V M. Therefore, Z{ is independent of Fj^ . Since Fj$ is an i.i.d. sequence, then 

"£p(v i+1 = o)>Y j p(z 1 = o, fS = o)=p (f$ = o) = 0), 

and thus Vi is recurrent if Zi is recurrent. 

Finally, it can be shown that Zi is a branching process with migration where the migration compo- 
nent has mean 1 — 5 and the branching component has offspring distribution that is Geometric(l/2) 
(see Lemmas 16 and 17 in [KZ08J). Then, previous results in the branching process with migration 
literature show that Zi is recurrent if and only if 5 > (see Theorem A and Corollary 4 in [KZ08J 
for a summary of these results). □ 

We close this section by noting that the above regeneration structure for the process Vi can be 
used to give a representation for the limiting speed of the excited random walk. First note that, as 
was shown in |BS07| . the representation ^ can be used to show that when 5 > 1, 



1 .. T, 



lim 



n 



1 n 

1 + 2 lim - > Vi. 

n. — ^oo n — J 



Vn n—>oo n n— >oo n 

1=1 

To compute the last limit above, first note that {(W^ — Wk-i,<7k ~ cr fc-i)}fc>i is an i.i.d. sequence 
and that the tail decay of a± given in Theorem 2.1 of |KMllj implies that -E[o"i] < oo whenever 
5 > 1. Let k(n) be defined by (?M n )-i < n < au n )- A standard renewal theory argument implies 
that 

kin) 1 

hm = — r. 

n^oo n E[ai\ 

Since Wu n \-i < Ya=1 Vn < Wk(n) an( i nm fc-s>oo Wk/k = E[Wi], this implies that 

n->oo n F\ai\ 
i=i 

Therefore, we obtain the following formula for the limiting speed of transient excited random walks. 
Lemma 2.4. If 5 > 1, then 

(8) V ° ~ E[a 1 +2W 1 Y 

Remark 2.5. The tail decay of W\ in ([7]) implies that £7[PFi] = oo when 5 E (1,2]. However, the 
limiting speed vq = when 5 G (1,2] so that the equality Q still holds in this case. 

3. Large Deviations for the Branching Process 



In this section we discuss the large deviations of n 1 YJi=i Vi- Let 



(9) A w , CT (A,7 / ) = logE 



e - l {cri<oo} 



A,r? G 



be the logarithmic moment generating function of (Wi,ai), and let 

(10) Ay (A) = — sup{?7 : Avt/ )(T (A, rj) < 0} and Iv(%) = sup Xx — Ay (A). 

A 

The relevance of these functions is seen by the following Theorem, which is a direct application of 
a more general result of Nummelin and Ney (see remark (ii) at the bottom of page 594 in [NN87J). 
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Theorem 3.1. Let Ly(x) be defined as in (jlOj). Then, 
limi 

for all open G and any j > 0, and 



lim inf - log P f - V Vi G G, V n = j J > - inf /y(x), 

n->oo n \ n ~[ J xeG 



limsup -logP ( - Y]Vi G P, V n = j ) < - inf Iy(x), 
/or a/Z closed F and any j > 0. 

In order to obtain large deviation results for the related excited random walk, it will also be 
necessary to obtain large deviation asymptotics of n~ l Y^=i Vi without the added condition on the 
value of V n . 



Theorem 3.2. Let Ly(x) be defined as in (10). Then, n 1 ^ satisfies a large deviation 
•principle with rate function Iy(x). That is, 



lim inf 1 logP ( 1 G G I >-inf/v(x), 

n->oo n Vn f-^ / xeG 



/or aZ/ open G, and 

(11) lim sup- log P I 1 W ; G F I <-inf/y(x), 

/or a/Z closed F. 

Remark 3.3. There are many results in the large deviations literature that imply a large deviation 
principle for the empirical mean of a Markov chain. However, we were not able to find a suitable 
theorem that implied Theorem |3.2| Some of the existing results required some sort of fast mixing 
of the Markov chain |BD96l IDZlOj . but the Markov chain {V*} i>o mixes very slowly since if Vq is 
large it typically takes a long time to return to (on the order of 0(Vq) steps). Moreover, it is very 
important that the rate functions are the same in Theorems |3 . 1| and |3 . 2| and many of the results for 
the large deviations for the empirical mean of a Markov chain formulate the rate function in terms 
of the spectral radius of an operator {dA85j instead of in terms of logarithmic moment generating 



functions as in (\9ti and (10). 



Proof. Obviously the lower bound in Theorem 3.2 follows from the corresponding lower bound in 



(3.1), and so it is enough to prove the upper bound only. Our proof will use the following facts 



about the functions Ay and ly. 

(i) Ay (A) is convex and continuous on (— 00, 0] and Ay (A) = 00 for all A > 0. Therefore, 
L v (x) = sup A<0 Ax - Ay (A). 

(ii) Ly(x) is a convex, non-increasing function of x, and lim^oo Iy(x) = ini x Ly{x) = 0. 



These properties and more will be shown in Section 3.1 below where we give a qualitative description 



of the rate function Ly. By property (ii) above, it will be enough to prove the large deviation upper 
bound for closed sets of the form F = (— 00, x\. That is, we need only to show that 

1 ( n \ 

(12) limsup — logP Vj < xn I < —Ly(x), Vx < 00. 

n->oo n \. =l J 
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This will follow from 

(13) lim sup i log J5[e A 2^=1 Vt ] < Ay (A), VA < 0. 



Indeed, combining (13) with the usual Chebyshev upper bound for large deviations gives that for 



any x < oo and A < 0, 



limsup- log P [y^Vi < xn) < limsup - log(e~ A:rn £[e A ^=i Vl ]) < -Xx + A v (X). 



Optimizing over A < and using property Kin above proves (12). 



It remains still to prove (13). By decomposing according to the time of the last regeneration 
before n we obtain 

n n—l 

= E[e^ v >-^)n 1{ai>n}] 



m=l t=0 



n n—l 



+ ^^i?[e^^W-l {ffm=n _ i} 

m=l t=0 
/ n 

a AW m -A v (A) 



(14) 
(15) 



'n-1 



+ x; ^[^- A - (A) ^i { . m <oo } ] E^[ eAELiy! " Av(A)ti i->*}] 



\m=l / \t=0 / 

where we used the Markov property in the second equality. The definition of Ay and the monotone 
convergence theorem imply that Ayt/ CT (A, — Ay(A)) < 0. Therefore, 

n n 



,AW m -A v (A) 



{<7 m <Oo} 



e mA W)tr (A,-A v (A)) < 



n. 



m=l 



m=l 



To bound the second sum in ( 15 ) we need the following lemma, whose proof we postpone for 
now. 



Lemma 3.4. For any A < ; 



Bupi^EU^-AvM! ]<00> 

t>0 



Lemma 3.4 implies the expectation (14) is uniformly bounded in n and that the second sum in 



(15) grows at most linearly in n. Since the first sum in (15) also grows linearly in n this implies 



that 



limsup- log E{e x ^=i v '- Av(x)n ] < 0, VA < 0, 

77,— VOO Tl 



which is obviously equivalent to (13). It remains only to give the proof of Lemma 3.4 
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Proof of Lemma 3.4 First, note that 

1 > E[e xw ^ x ^l {ai<oo} ] > E[e xw ^^l {t<ai 



<oo 



(16) 



E 
E 



° J -{ai>t\ c - - L {cri<oo} 



;AELl^- A v(A)t 1 



Wi>t} 



E' 



C - L {fTl<00} 



where in the last equality we use the notation E m for the expectation with respect to the law 
of the Markov chain Vi conditioned on Vq = m. Since V{ is an irreducible Markov chain and 
E[e XWl ~ Av ^ x ' ai lf CTl<00 \] < 1, then the inner expectation in (16) is finite for any value of Vt and 
can be uniformly bounded below if Vt is restricted to a finite set. Thus, for any K < oo, 



E 



(17) 



{<Tl>t,V t <if} 



< inf E m [e xw ^ A ^l {(T1<oo} }) E[e xw ^ A ^l Wl<oo} }. 



Let Ck a < oo be defined to be the right side of the inequality above. 



Note that the upper bound (17) doesn't depend on t. The key to finishing the proof of Lemma 



3.4 is using the upper bound (17) in an iterative way. For any t > 1 
E[e MZ:iVi-A vW t 1{n>t}] < c K , x + E[e x ^ v ^ x H 



}] 



{ai>t,V t >K 

where in the last inequality we used that {a\ > t, Vt > K} = {o~i > t — 1, Vt > K}. Iterating the 
above bound implies that 

E[e x ^ v >- Av(x H Wl>t} ] < C K ^e l{ - XK - hv W) + e ^-A v (A))_ 

1=0 

By choosing K > Ay(A)/A so that e XK ~ A vW < i ? 

we thus obtain that 
S u P £[e^=^-A v (A) tl ]< C ^ x 

t>0 1 



+ 1 < OO. 



□ 



□ 



3.1. Properties of the rate function ly. We now turn our attention to a qualitative description 
of the rate function ly . Since Iy is defined as the Legendre dual of Ay , these properties will in turn 
follow from an understanding of Ay (and also A\y,a)- We begin with some very basic properties of 
Ay and the corresponding properties of Iy . 

Lemma 3.5. Ay(A) is non- decreasing, convex, and left- continuous as a function of X. Moreover, 

(i) Ay (A) G (logE[w (l)],0) for all A < 0, and lim A _>_ 00 Ay(A) = logE[w (l)]- 

(ii) Ay (A) = oo ifX > 0. 



Proof. Recall the definitions of A]y,a and Ay in ([9]) and (10), respectively. The fact that Ay (A) is 
non-decreasing follows from the fact that A\v,a(Xi, n) < A\y t(T (\2, n) for any Ai < A2. Since A\y^ 
is the logarithmic generating function of the joint random variables {Wi,a\), then A\y ;CT (X, r/) is a 
convex function of (A, rj) (and strictly convex on {(A, n) : A\y,a(X, n) < 00}). The convexity of Ay as 
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a function of A then follows easily from the convexity of A^y j0 -(A, 77) and the definition of Ay. Also, 
left-continuity follows from the definition of Ay and the fact that \ira.\-+\> ^-w,a(^,v) = Aw,o-(A', 77) 
by the monotone convergence theorem. 

Since W\ > 0, A w>a (X,0) = log£[e AW/l l{ CTl<00 }] < for all A < 0. On the other hand, since 
Wi > o\ — 1 it follows that Avy )(7 (A, —A) < —A < 00 for all A < 0. Then the continuity of A.y/, a and 
the definition of Ay (A) imply that Ay (A) < for all A < 0. Additionally, 

E[e xw ^l {ai<oo} ] > e^P(a 1 = 1) = e"E[u, (l)], 
which implies that Avy CT (A, - log E[cj (1)]) > for all A < 0. Thus, Ay(A) > logE[w (l)] for all 



A < 0. To prove the second part of property (i) , note that o\ < W\ + 1 implies that for 77 > 



(18) lim E[e XWl+ ^l Wl<oo} ] < lim E[e^ +r ^ Wl+n ] = e^Pi^l x = 0) = e r 'E[a;o(l)], 

A— >— 00 A — > — 00 

where the second to last equality follows from the bounded convergence theorem. From ( |18[ ) and 
the definition of Ay, it follows that lim^.oo Ay(A) < logE[wo(l)]- Combining this with the first 



part of property (i) implies the second part of property nT) 



To show that Ay (A) = 00 for A > it is actually easiest to refer back to the excited random 
walk. Recall the naive strategy for slowdowns of the excited random walk in Example |1.1| We can 
modify the strategy slightly by not only consuming all cookies in [O,?! 1 / 3 ) and then staying in the 
interval for n steps, but also requiring that the random walk then exits the interval on the right. 

1 /3 

This event still has a probability bounded below by Ce~ cn . Examining the branching process 
corresponding to the excited random walk we see that event for the random walk described above 
implies that Uf > 1 for all i G [1, N - 1], = and J2?=i > n/2, where N = [n 1 / 3 ] . Then, 
using ^ we obtain that that P(W\ > n/2, o\ = [re 1 / 3 ]) > Ce~ cnl/3 for all re > 1 which implies 
that 

E[e XWl+r > ai l Wl<oo} ] > e Xn/2+vnl/3 P(W! > n/2, a x = [n 1 / 3 ]) > Ce Xn/2+rinl/ '- cnl '\ 

for any A > and r/ < 0. Since this lower bound can be made arbitrarily large by taking n — > 00, 
this shows that Aiy,o-(A, rj) = 00 for any A > and rj < 0, and thus Ay (A) = 00 for all A > 0. □ 



We would like to say that Avy )Cr (A, — Ay(A)) = 0. However, in order to be able to conclude this 
is true, we need to show that Ajy )(T (A, rf) £ [0, 00) for some rj. The next series of lemmas gives some 
conditions where we can conclude this is true. 

Lemma 3.6. If X < log E[a;o(l)], then 

(19) Aiy jCT (A,-Ay(A)) = 0. 

Moreover, Ay(A) is strictly convex and analytic on (— 00, logE[o;o(l)])- 



Proof. Since W\ > a\ — 1 we have that 

Aw>(A, -A) = logE[e x ^-^h {ai<oo} ] < e~ x . 

Therefore, Aw^ a (\, rf) < 00 for all A < and rj < —A. On the other hand, it was shown above that 
Aw,ct(A, 0) < and Ajy jCT (A, — logE[o;o(l)]) > when A < 0. Since Avy j0 -(A,r/) is monotone increas- 
ing and continuous in 77 this implies that Ajy j(7 (A, 77) = has a unique solution 77 G [0, — log E[^o(l)]] 
when A < logE[cjo(l)]. By the definition of Ay and the fact that Aiy CT (A, 77) is strictly increasing 
in rj, this must be 77 = —Ay (A). 



Let T>w,(j = {(A, 77) : Ajy,r(A,?7) < 00}. The above argument shows not only that (19) holds but 
also that (A, —Ay (A)) is in the interior of T>w,a when A < logE[^o(l)]- Since Apy CT is analytic on 
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Pvk,ct ; the implicit function theorem implies that Ay (A) is analytic on (— oo, log E[wo(l)])- Finally, 



combining (19) with the fact that Aw ja is strictly convex on T>\y )(J implies that Ay (A) is strictly 



convex on (—00, log E[wo(l)])- □ 

Lemma 3.7. For every m < 00, there exists a Ao = Ao(m) < such that Aw )CT (A, —Am) < 00 for 
all A G (A Q ,0). 

Proof. We need to show that £[e Wl - Wl l {(7l<oo} ] = E[e x ^=i^- m h {(Tl<oo} ] < 00 for A negative 
and sufficiently close to zero. Since A < we need to bound the sum in the exponent from below. 
Note that all the terms in the sum except the last one are larger than — (m — 1) and that the terms 
are non- negative if V{ > m. Therefore, letting N m = #{1 < i < u\ : Vi < m} we obtain that 

E[e xm - Xmai l {ai<oo} ] < E[e- x{m - l ^ N ™). 

To show that this last expectation is finite for A close to zero, we need to show that N m has 
exponential tails. To this end, note that the event {iV m > n} implies that the first n times that 
the process Vi < m, the following step is not to 0. Thus, 

P(N m >n)< (maxP(Vi ^ | V = k)) = P(Vi ^ | V = \m~\ - 1)". 

y k<m J 

Therefore, the statement of the Lemma holds with 



Ao(m) = — — logP(Vi ^ I Vq = H - 1) 
m — 1 



□ 



Corollary 3.8. If 5 > 2 (so that E[Wi], E[ai] < 00), then there exists a X\ < such that on the 
interval (Ai, 0) 

(i) A Wi(J (A,-A y (A)) = 0. 

(ii) Ay (A) is analytic and strictly convex as a function of A. 

(iii) lim A _> 0+ A y (A) = E[Wi]/E[at] =: m . 

Proof Let V\y,a = {(A,??) : A\v ja (X, rf) < 00} be the domain where A^o- is finite, and let D^ Va be 
the interior of Dw,a- Define mo = E[Wi]/ E[a\]. Then, since Avi/,a-(A, n) is monotone increasing in A 
we have that (A, — Amo) G T^wa ^ — Amo < —Xo(m)m for some m > mo- Thus, (A, — Amo) G T^w,a 
if A G (Ai,0) with Ai = inf m>mo Ao(m)m/mo. 

Since Aiy i(7 is analytic and strictly convex in D^ a , the function g{\) = A\y i(T (\, — moA) is strictly 
convex and analytic on the interval (Ai,0). In particular, g is differentiable and 



dX 

Since g is strictly convex, 



_g ^ e A(Wi-m o-i)j 



g'(X) < lim g'(X) = E[Wi-m (Ti] =0, VA G (Ai,0). 

Therefore, g(X) is strictly decreasing on (Ai,0). Since, lim^Q- 9W = = we obtain that 
g(X) = Aiy CT (A, — moA) > for A G (Ai, 0). Thus, for every A G (Ai, 0) there exists an n G (0, —mo A) 
such that Ayi/ j0 -(A, n) = 0, and the definition of Ay implies that r\ = —Ay (A). We have shown that 
Aw,o-(A, — Ay(A)) = and (A, — Ay(A)) G T>^ a for all A G (Ai, 0). As was the case in the proof of 



Lemma 3.6 these facts imply that Ay(A) is analytic and strictly convex on (Ai,0). 
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To show that lim^o- Ay- (A) = 
For m < mo define g m (X) = Ajy i( 
convex and analytic, and that 



mo, first note that as was shown above Ay (A) > toqA for A < 0. 



(A, 



9m ( A ) 



d 

dX 



\ogE 



-mX). For A close enough to we have that g m (X) is strictly 
E [(W\ - ma-\ )c 



= A(Wi— mat) 



A(Wi-mcri)l 



Therefore, lim A ^ - g„(A) = i?[Wi 



£ r e A(Wi-mcri)j 
mE[ai] > 0, and thus there exists a A2 



A2O71) < such 



that 5 m (A) = Ay^ CT (A, — mA) < for A G (A2, 0). This implies that moA < Ay(A) < mX for all 
A G (A2,0), and thus lining- Ay (A) G [m,mo]. The proof is finished by noting that this is true for 
any m < mo- □ 

We are now ready to deduce some properties of the rate function Jy . 
Lemma 3.9. mi x Iy(x) = 0. 

Proof. Since Iy is the Legendre transform of Ay and Ay is lower-semicontinuous, then it follows 
that infa; Iy(x) = Ay(0). When 5 > 0, then A w>a (0,0) = logP(o"i < 00) = (by Lemma K3§ and 
thus Ay(0) = when 5 > 0. 

When 5 < 0, then Avy ;(T (0, 0) = logP(eri < 00) < and so it is more difficult to show Ay(0) = 0. 
Instead we will prove in£ x Iy(x) = in a completely different manner. First note that letting 
F = G = M in Theorem |3.2| implies that 

lim -logP(K = 0) = - inf Iyix). 

n— >oo n x 

Therefore, we need to show that P(V n = 0) doesn't decay exponentially fast in n. The explanation 
of the representation ^ implies that P(T n < TLi) = P(Uq = 0) < P(V n = 0), and thus we are 
reduced to showing that P(T n < T_i) doesn't decay exponentially fast in n. In fact, we claim that 
there exists a constant C > such that 

(20) P(T n < T_i) > Cn- M - 1 

To see this, suppose that the first 2M + 1 steps of the random walk alternate between and 1. The 
probability of this happening is 

"~M+i M 
P{X 2i = 0, X 2l+1 = 1, i = 1, 2, . . . M) = E ' 



j=i 



E 



M 



n^(j) 



E 



M 



na-^o(i)) 



3=1 



> 0. 



At this point the random walker has consumed all the "cookies" at the sites and 1. Therefore, 
by a simple symmetric random walk computation, the probability that the random walk from this 
point hits x = 2 before x = — 1 is 2/3. Since 5 < the random walk will eventually return from 
x = 2 to x = 1 again, and then the probability that the random walk again jumps M more times 
from x = 1 to x = 2 without hitting x = —1 is (2/3) . After jumping from x=ltox = 2a 
total of M + 1 times there are no longer any cookies at x = 2 either, and thus the probability 
that the random walk now jumps M + 1 times from x = 2 to x = 3 without visiting x = — 1 is 
(3/4) M+1 . We continue this process at successive sites to the right until the random walk makes 
M + 1 jumps from x = n — 2 to x = n — 1 without hitting x = —1 (which happens with probability 
((n - l)/n) M+1 ). Upon this last jump to x = n — 1 the random walk has consumed all cookies at 
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x = n — 1 and so the probability that the next step is to the right is 1/2. Putting together the 
above information we obtain the lower bound 



pa), < :/_, )> | 



M 

]>o(j) 



E 



ill 



3=1 




n 



1 



n 



A/+1 



This completes the proof of (20), and thus inf^ Iy(x) = when 5 < 0. □ 
Lemma 3.10. The function Iy (x) is convex, non-increasing, and continuous on [0, oo). Moreover, 

(i) There exists an mi > such that Iy(x) is strictly convex and analytic on (0, m 2 ). 

(ii) ly(0) = — logE[(Jo(l)] and lim :r _ s>0 + ly(x) = — oo. 

(hi) If S > 2 i/ien t/iere exists an mi < too = i£[Wi]/i?[<7i] such that iy(x) is strictly convex 
and analytic on (mi, mo), Iy(x) = for x > mo, and lhn - I'y(x) = so that Iy is 
continuously differentiable on (toi,oo). 

(iv) If 5 < 2 then Iy(x) > for all x < oo. 



Proof. The convexity of Iy follows from the definition of Iy as the Legendre transform of Ay . The 
fact that Iy(x) is non-increasing follows from the fact that Ay (A) = oo for any A > 0. Indeed, if 
xi < X2 then 

Iy(xi) = sup Axi — Ay (A) > sup \X2 — Ay (A) = Iy(x2)- 

A<0 A<0 

Iy is also lower-semicontinuous since it is defined as a Legendre transform, and since it is also 
non-increasing it follows that Iy is continuous on the domain where it is finite. 



Next, recall from Lemma 3.6 that Ay(A) is strictly convex and analytic on (— oo, log E[cjo(1)]) and 
let to-2 = lirnA-^iogE[u> (l)])- Ay (A). The fact that Ay(A) is uniformly bounded below also implies 
that lim^^.oo A'y (A) = 0. Therefore, for every x G (0,m2) there exists a unique A = \(x) such 
that A'y(A(x)) = x and so 

(21) I v (x) = \(x)x- Ky(\(x)) forxG(0,TO 2 ). 

Since Ay (A) is analytic on (— 00, log E[wq(1)]) the inverse function theorem implies that \(x) is 



analytic on (0,777,2) and thus (21) implies that Iy(x) is analytic on (0,to 2 ) as well. To see that 



Iy(x) is strictly convex on (0,7712), we differentiate (21) with respect to x and use the fact that 
A'y(A(x)) = x for x G (0, 7712) to obtain 

(22) I' v (x) = X(x), for x £ (0, to 2 ). 

Since A(x) is strictly increasing on (0, m 2 ), it follows that Iy is strictly convex on (0, TO2). Moreover, 
( [22] ) implies that lim x ^o^ 



I'y(x) 



lim 

M0) = -inf A Ay(A) = -logE[o;o(l)]. 



x— >0- 



\(x) 



00 and Lemma 3.5 (i) implies that that 



When 5 > 2, Lemma |3 . 8| implies that Ay (A) is analytic and strictly convex on (Ai,0). Let mi = 
lim A _^ A + A'y(A) and recall that lim^o- A'y(A) = to,q = i?[Wi]/l£[<7i]. Then the same argument as 



above shows that Iy(x) is strictly convex and analytic on (toi,too) and that lim 



lUx) = 0. 



Now, since A'y(A) increases to too as A — > _ and Ay(0) = 0, then Ay (A) > moA for all A < 0. 
Therefore 

Iy(x) = sup Ax — Ay (A) < sup Atoo — Ay (A) = 0, for all x > too, 

A<0 A<0 

where the first equality follows from the fact that Ay (A) = 00 if A > 0. However, since Iy(x) > 
— Ay(0) = it must be that Iy(x) = for x > ttt-q. 
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It remains only to show that Iv( x ) > for all x when 5 < 2. We will divide the proof into two 
cases: 5 G (1,2] and 6 < 1. 

Case I: 5 G (1,2]. 



For any m < oo let g m {\) = Ajy j(T (A, — mA). Then, as in the proof of Corollary 3.8, g m (\) is strictly 
analytic and convex for A < close enough to zero. Moreover, 

lim g' m {\) = E'fWi — mai] = oo, 

A— >o~ 

where the last equality holds since the tail decay of W\ and o\ in implies that E\W\] = 
oo and E[a±] < oo when 5 G (1,2]. Since <? m (0) = Aty jO -(0,0) = 1 this implies that g m {\) = 
Aw i(T (A, — mA) < for A < sufficiently close to 0, and therefore lim sup^g- Ay(A)/A > m. Since 
this is true for any m < oo and since Ay (A) is convex, it follows that lim A _ 5>0 - Ay(A)/A = oo. Thus, 
for any x < oo there exists a A' < such that Ay (A') < X'x and so Iv(%) > X'x — Ay (A') > 0. 

Case II: 5 < 1. 

As in the case S G (1,2] we could proceed by arguing that g' m (\) — > E[(Wi — m^i)l{ ffl <oo}]' 
However, we would need to then show that this last expectation is infinite, and this would require 
an analysis of the joint tail behavior of (Wi,cri). This could probably be achieved in the case 
S G (0, 1) by adapting the arguments of Kosygina and Mountford in [KM11], however when 5 < 
it would be more difficult since in that case the Markov chain is transient and we would need to 
analyze the tails of o\ conditioned on a\ < oo. It is possible that such an approach would work, 
but we will give a softer argument instead. 

Let Ai(A) = limsup n _ 5-00 1 log E'fe^^^ 1 Vi ]. Then, the standard Chebyshev large deviation upper 
bound implies that 

1 ( n \ 

lim sup — log P \y Vi < xn \ < — sup(Ax — Ai(A)). 



On the other hand, Theorem 3.1 and the fact that Iy is non-increasing implies that 



lim - log P ( \^ V{ < xn, V n = | = -Iy (x). 

Thus, we see that Iy(a;) > sup A<0 (Ax — Ai(A)) for any x < oo. Then, similar to the case 6 G (1, 2] 
above, it will follow that /y(aj) > for all x < oo if we can show that lim^Q- Ai(A)/A = oo. 

Fix an integer K > 1. If A < 0, then A£™ =1 Vi < \KY%=i 1 {V t >K}- Thus, 
(23) E&VU %] < e^d-fln + p{jZ Hv^K) > ^ • 

Recall the construction of the process Vi in Section [2] and define Vi by Vo = and 

Vi + 1=F ^+l 1 {F^ + >KY 

That is, jumps are governed by the same process as the jumps of the Markov chain Vi with the 
exception that any attempted jump to a site in [1,K — 1] is replaced by a jump to 0. Note that 
the above construction of Vi gives a natural coupling with Vi so that Vi < Vi for all i. Let cx^, 
k = 1,2,... be the successive return times to of the Markov chain Vi. Then, since Vi < K implies 
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-XK 




h(-XK) l 



Figure 1. For any fixed A < 0, the supremum in (24) is attained at the intersection 



of the two curves. A lower bound for the supremum is obtained by evaluating the 
line XK{9 - 1) at 9 = h(-XK). 



that Vi = 0, 

p (E >onj<p nr, hvi<K} > e ™) - p (*r<n ^ n )- 

Since is the sum of k i.i.d. random variables, Cramer's Theorem implies that this last probability 
decays on an exponential scale like e _nW a(i/^) j w here 1^ is the large deviation rate function for dk/k. 



Recalling Q, we see that Ai(A) < -min{Aif(0 - 1), 9Ia{l/&)}. Optimizing over (9 G (0,1) 
gives 



(24) 



Ai(A) < 



sup mm{XK(8 

6*6(0,1) 



i), ei s {i/9)}. 



The modified Markov chain Vi inherits the same recurrence/transience properties that Vi has. In 
particular, Vi is null-recurrent if 5 G [0, 1] and transient if 5 < 0. In either case -Efoi] = oo and so 
it can be shown that Ia(x) is convex, non-increasing, and Ia(x) > for x G [l,oo). Therefore, the 
function 9 \— > 91^(1/9) is convex and strictly increasing on (0, 1) and approaches as 9 — > 0. Thus, 
there exists an inverse function h so that h(x)I&(l/h(x)) = x and h(x) — > as x — > 0. We will use 
this information to analyze the upper bound in (24). 



Since the term in the minimum of (24) is decreasing in 9 and the second term in the minimum 



is increasing in 9, the supremum is obtained for the value of 9 that makes the two terms in the 
minimum equal. Thus, the supremum is greater than XK(h(—XK) — 1) (see Figure [I]) which in 
turn implies that Ai(A) < XK(1 — h(—XK)). Therefore, 

Ai(A) 



lim inf 

A-s-o- 



A 



> lim K(l 

A->o- 



h{-XK)) = K. 



Since the above argument works for any finite K, this implies that lim^o- Ai(A)/A = oo. 



□ 



4. Large Deviations for Hitting Times 



The large deviation principles for n 1 "Y^i=i Vi m Theorems 
principle for the hitting times. 



3.1 



and 



3.2 



imply a large deviation 



Theorem 4.1. Let ir(i) = Iv{{t ~ Then, T n /n satisfies a large deviation principle with 

convex rate function ir(i)- That is, 



(25) 



lim inf - logP (T n /n G G) > - inf I v (x), 
n— >oo n xeG 
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for all open G, and 

(26) lim sup- log P(T n /n €F)<- inf I v (x), 

n— >oo fl x£F 

for all closed F. Moreover, the following qualitative properties are true of the rate function It- 

(i) ir(£) is convex, non-increasing, and continuous on [l,oo), and there exists a t<i > 1 such 
that Jr(i) is strictly convex and analytic on (1, ^2) ■ 

(ii) It(1) = — logE[wo(l)] and lim t _ >1 + I' T {t) = —00. 

(iii) If 5 > 2, then It{P) = -s=>- t > 1/vq. Moreover, there exists a t\ < 1/vq such that ir(£) 
is strictly convex and analytic on (ti, 1/vq) and continuously differ entiable on (ii,oo). 

(iv) If 5 < 2, then Jt(£) > for all t < 00 and linn._-.oo ir(i) = 0. 



Proof. The properties of the rate function It follow directly from the corresponding properties of 
ly proved above in Lemmas 3.9 and 3.10 Note that when 5 > 2 we use that the formula for the 
limiting speed of the excited random walk in Q implies that l/vo = E[a\ + 2W / i]/E'[cri] = 1 + 2mo. 

Recall the relationship between the hitting times T n and the processes Vi and ' given in ([5]). 
Then, 



P(T n /n e G) > P I 1 + -J^Vi € G, V n = O) , 



i=l 



since K- 



(n) 



for all * > 1 if V r 



(n) 



= 0. The large deviation lower bound (25) then follows 



from Theorem 13.1 



Since It is non-increasing and inff Jr(i) = 0, as in the proof of Theorem 3.2 the large deviation 
upper bound will follow from 

(27) lim sup - logP(r„ < nt) = -I T (t). 

n— >oo fl 

Again, the relationship between the hitting times T n and the processes Vi and V^ given in §5§ 
implies that 

(t- l)n\ 



P{T n <tn)<P\Y J V l < 
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and Theorem 3.2 implies that (27) holds. 



□ 



To obtain a large deviation principle for the position of the excited random walk we will also 
need a large deviation principle for the hitting times to the left. However, this is obtained directly 



as a Corollary of Theorem 4.1 by switching the direction of the cookie drifts. To be more precise, 
for any cookie environment u = {uji(j)}, let uj = {uJi(j)} be the associated cookie environment 
given by iOi(j) = 1 — coi(j). Let T n be the hitting times of the excited random walk in the cookie 
environment U. An obvious symmetry coupling gives TL n = T n . 

Corollary 4.2. The random variables T_ n /n satisfy a larg e de viation principle with convex rate 
function 1^, where Iif is the rate function given by Theorem 4-1 for the hitting times T n /n. 



Remark 4.3. Since 5 = -EEjf=i(2wo(j) — 1)] = —E[^2j! =1 (2uJo(j) — 1)] = —5, the properties of the 



M 



rate function Ij, are the same as the properties of the rate function It given by Theorem 4.1 when 
5 is replaced by —5. For instance, ijr(i) > for all t < oo if 5 > —2. 



5. Large deviations for the random walk 



In this section will show a large deviation principle for X n /n. We begin by defining the rate 
function Ix{x). 

' xI T {l/x) x>0 

(28) I x ( x ) = )q x = o 

Jx|%(l/|x|) x < 0. 

Before stating the large deviation principle for X n /n we will prove some simple facts about the 
rate function Ix- 

Lemma 5.1. The function Ix is non-negative and continuous on [—1,1] and has the following 
additional properties 



(i) Ix( x ) is non-increasing on [—1,0] and non-decreasing on [0,1]. 

(ii) I X {x) is a convex function. 

(iii) I*(-l) = -k>gE[l-wo(l)] and I x (l) = - log E[wo(l)]. 

(iv) There exist x~2 G (—1,0) and X2 £ (0,1) such that Ix is strictly convex and analytic on 
(-l,x 2 ) and (x 2 , 1). 

(v) \im x ^_i+ I'x(x) = —oo and lim x _ >1 - I' x ( x ) = °°- 

(vi) r x (0)=lim x ^ I x (x)/x = 0. 

(vii) If S £ [—2,2], then Ix(%) = if and only if x = 0. 

(viii) If 5 > 2, then Ix(x) = if and only if x G [0, Vo], and there exists an x\ G (vq, 1) such that 
Ix is strictly convex and analytic on (t>o,xi) and continuously differ entiable on [0,xi). 

(ix) If 5 < —2 then Ix(x) = if and only if x £ [vo,0], and there exists an x\ G (— 1,1*0) 
such that Ix is strictly convex and analytic on {x\,vq) and continuously differentiable on 
(a?i,0].. 



Proof. Most of the properties in the statement of the Lemma follow directly from the correspond- 
ing properties of It (or 1^) given by Theorem 4.1, and thus we will content ourselves with only 
discussing property (ii) from the statement of the Lemma. 
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5 > 2 



S €[-2,2] 

-log£[l-Wd(l)] 




Figure 3. A visual depiction of the rate function Ix in the cases 5 > 2 and <5 € 



-2,2] showing the qualitative properties stated in Lemma 5.1 



It is a general fact of convex analysis that if f(x) is a convex function on [l,oo) then g(x) = 
xf(l/x) is also a convex function on (0, 1]. Therefore, the convexity of It and 1^ imply that Ix is 
convex on [—1,0) and (0, 1], respectively. Next, note that lim 2 ._ s>0 + Ix(x) = lim a ._ >0 + x ^t{^/x) = 
since It is finite and non-increasing, and similarly lim x _+ - Ix(x) = 0. Therefore, Ix is continuous 
at x = which in turn implies that Ix is convex on [—1,0] and [0,1], respectively. Finally, the 
convexity of Ix on all of [—1, 1] follows from the convexity on [—1, 0] and [0, 1] and the monotonicity 
properties in in part (i) of the lemma. □ 



We now are ready to prove the large deviation principle for the position of the excited random 
walk. 



Proof of Theorem \l.l\ Since the rate function Ix is non-increasing on [—1,0), non-decreasing on 
(0, 1], and lx(0) = it is enough to prove the large deviation upper bound for closed sets of the 
form F = [x, 1] with x > or F = [— l,x] with x < 0. To this end, let x > and note that 



{X n > nx} C < n}. Then, Theorem 4.1 implies that 



limsup — logP(X„ > nx) < limsup — log P(T^ nx -\ < n) 

n— too n n - " 



n 



-x! T (l/x), Vx 6(0,1]. 



Similarly, if x < then {X n < nx} C {Tr^n < n} and Corollary 4.2 implies that 



limsup — logP(X n < nx) < limsup — log P(T_r n \ x n < n) 



n 



n 



\x\I T {l/\x\), VxG[-l,0). 



Recalling the definition of Ix(x) in (28) and the monotonicity properties of Ix in Lemma |5.1 
finishes the proof of the large deviation upper bound. 

To prove the large deviations lower bound it is enough to show that 



(29) 



lim liminf — log P(\X n — nx\ < en) > — Ix(x), £ [—1,1], 

£-►0+ n->oo n 



First consider the case where x G (0, 1]. Then, since the random walk is a nearest neighbor walk 

P(\X n - nx\ < en) > P(\T^ - n\ < en - I). 
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Then, Theorem 4.1 implies that for any x G (0, 1], 

1 1 
lim liminf — logP(|X n — nx\ < en) > lim liminf — logPf \Tr„ 

> -xI T {l/x) = -I x (x), 



n\ < en — 1) 



and a similar argument shows that (29) also holds for x G [—1, 0). F inally, to show that (29) holds 
when x = note that the naive slowdown strategy in Example 1.1 implies that P(|X ra | < n 1 / 3 ) > 

en 1 / 3 



Ce 



and thus 



lim liminf 1 log P(\X n \ < en) = = -I*(0). 

£-►0+ rt->oo n 



□ 



6. Slowdowns 



If 5 > 2, then Lemma 5.1 shows that the rate function Ix is zero in the interval [0,t>o]. Thus, 
probabilities such as P(X n < nx) decay to zero sub-exponentially for x G (0,t>o). Similarly, since 
It is zero in [l/-uo,oo) probabilities of the form P(T n > nt) decay sub-exponentially if t > l/t>o. 
The main goal of this section is to prove Theorem |1.3| which gives the correct polynomial rate of 
decay for these probabilities. 



In order to prove Theorem 1.3 we will need the following bound on backtracking probabilities 



for transient excited random walks. 

Lemma 6.1. Let 5 > 1. Then there exists a constant C > such that for any n,r > 1, 

P ( inf X k < n | < Or 1 ' 6 . 
\k>T n+r ) 

Remark 6.2. In |BS08j . Basdevant and Singh showed that such backtracking probabilities could 
be bounded uniformly in n by a term that vanishes as r — > oo. However, their argument uses an 
assumption of non- negativity of the cookie strengths, and their bounds do not give any information 
on the rate of decay of the probabilities in r. Our argument is more general (allowing positive and 
negative cookie drifts) and gives a quantitative rate of decay in r. 

Proof. First, note that 

(30) P ( inf X k < n ) = lim P I inf X k < n 

\k>T n+r ) m->oo \T„ +r <k<T m 

The event {iniT n+r <k<T m X k < n} implies that for every site i G [n + 1, n + r] the excited random 
walk jumps from i to i — 1 at least one time before time T m . Therefore, 



P inf X k < n < P(U™ > 1, Vi G [n + 1, n + r}) 

(31) = P(Vi > 1, Vi G [m — n — r, m — n — 1]). 

Now, the asymptotic age distribution for a discrete renewal process (see Section 6.2 of |Law06j ) 
implies that for any k > 1 

lim P{Vi ^ Ofor all m < i < m + k) = Mi^LZ^Z+l . 

m— !>oo P[criJ 
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Applying this to (30) and (31) we obtain 



Pi inf X h < n < 



K k>T n+r " ~ J ~ E[(Ji] 

The tail decay of ai in ^ implies that when 5 > 1 there exists a constant C > such that 
E[(ai - r)+] < Cr 1 - 5 for any r > 1. □ 



We will also need the following large deviation asymptotics for heavy tailed random variables. 

Lemma 6.3. Let {Zk}k>i be i.i.d. non-negative random variables with P{Z\ > t) ~ Ct~ K for some 
k > 1 and C > 0. Then, 

iogP(ELi^>^) 



lim 

n— >oo 



logn 



1-re, Vx>MZi]. 



Remark 6.4. Lemma 6.3 is not new, but we provide a quick proof here for the convenience of the 



reader since we could not find a statement of this lemma in the literature. 



Proof. The statement of the Lemma follows easily from |Nag79 equation (0.3)] when k > 2. Ineed, 
if k > 2 then in fact 



P I %k > xn \ ~ nP{Z\ > xn) ~ Cx "n 1 K , as n — > oo, 



\fc=i 



for any x > E\Z\\. 



When k G (1,2] it is no longer necessarily true that P(%2k=i ^ k ^ xn ) ~ nP{Z\ > xn) and so 
a different approach is needed. To this end, first note that since the Z^ are non-negative a simple 
lower bound is 

P Z k > xr ^j > p ( 3k < n: Z k > xn) = 1 - (1 - P(Zi > xn)) n . 
Since 1 — (1 — p) n > np + (np) 2 /2 for any n > 1 and p G [0, 1] this implies that 

P Z fc > mj > nP(Z 1 > xn) + ^n 2 P(Z x > xn) 2 ~ Cx" K n 1 - K . 

To obtain a corresponding upper bound when n G (1, 2], note that E[Zj] < oo for any 7 G (0, k). 
Then, |Bri62| implies that P(Y^k=i Z k > xn ) = o(n 1_7 ) for any 7 G (0, k) and any x > E[Z±], and 
this is enough to complete the proof of the lemma. □ 

We are now ready to give the proof of the main result of this section. 



Proof of Theorem \1.3[ We first prove the polynomial rate of decay for the hitting time probabilities 
in (|4|). Since o~k and Wt are sums of k i.i.d. non-negative random variables with tail decay given 



by (|7|), Lemma 6.3 implies that 
(32) 
and 



lim 



1 



(33) 



fe-5>oo log k 
1 



P(a k > ky) = 1 - S, iiy>E[ai], 



lim 

fe-s>oo log k 



P{W k >ky) = l- 5/2, if y > E[W X \. 



22 



JONATHON PETERSON 



(n) 

Recall the relationship between the hitting times T n and the branching processes Vi and 

given in ([5]). Also, note that the branching process V^' starts with V^ = V n and has the same 
offspring distribution as the branching process Vi but without the extra immigrant each generation. 
Thus, = implies that vj n ^ for all j > i and the processes are naturally coupled so that 

Vi < V n +i for alH > 1. Therefore, T n is stochastically dominated by n + 2 Yli=i ) ^ = n + 2W / fc( n )) 
where k(n) is defined by cr k r n \i < n < cry n y Thus, for any c > 

P(T n > nt) < P(k(n) >cn)+P (w lcn} > ^ ~ ^ ) 
(34) < P(a [cnl <n) + P (V LcnJ > ^ ) ■ 

While (32) implies that the right tail large deviations of a k jk decay polynomially, the left tail large 
deviations decay exponentially since a k is the sum of non-negative random variables (use Cramer's 
theorem). That is, 

lim - log P(cr fc < ky) < 0, if y < E[at]. 

k^oo K 

Therefore, if we can choose c such that 1/c < and (i — l)/(2c) > -E^Wi] the first term in (34) 

will decay exponentially in n while the second term will decay polynomially on the order n 1- ^ 2 . 
The assumption that t > l/v = 1 + 2E[Wi]/E[a 1 ] implies that (t - l)/2 > E[W{\/E[<ji] and so 
such a c may be found. 

For a matching lower bound on the polynomial rate of decay of P(T n > nt), we again use the 
relationship between the hitting times and the branching process in ^ to obtain 

P(T n >nt)>p[f J V l > ri{l 




If c < (-E[o"i]) 1 then the assumption that t > 1/vo implies that (t — l)/(2c) > E'fWi], and so (33) 
an d rt32~l ) imply that P(T n > nt) > n lS/2+o(i) _ n i-8+o(i) = n i-5/2+o(i)_ Thig comp i etes the proof 

of ^ 

We now turn to the subexponential rate of decay for P(X n < xn). A lower bound follows 
immediately from Q since P{X n < xn) > P(Tr xn i > n). To obtain a corresponding upper bound, 
note that 

P{X n < xn) < P(T r „ (;c+e)1 > n) + P ( inf X k < xn ) 

\ v fc>T|- n(a . +£)1 J 

(35) <P(T ln(x+£)] >n) + C(n £ ) 1 -' 5 , 

where the last inequality follows from Lemma |6.1| Now, if e > is sufficiently small (so that 
x + e < vq) then Q implies that the probability in (35) is n 1_ */ a+ °W. Since n 1 ' 5 ' 2 is much larger 
than n 1-5 this completes the proof of the upper bound needed for (|3|. □ 
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